Skip to content

Conversation

@temporal-nick
Copy link
Contributor

What was changed

The health checks enabled recently are too aggressive for what we want them to do. Keep the main branch available while we re-approach appropriate health checks.

Why?

The original health checks were trying to mark hosts healthy based on the availability of additional capacity on the inbound
or outbound connection. This is in general how we want routing to work when a remote host establishes an inbound mux and when a local host tries to get an outbound connection, but communicating that nuance to GCP/AWS load balancing hasn't been working out, resulting in host brownouts. Just disable the connectivity check for now.

@temporal-nick temporal-nick requested a review from a team as a code owner December 2, 2025 00:40
return false
}
}
// TODO: Rethink health checks. The inbound/outbound traffic availability isn't quite right for a health check
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

can we not check-in commented code?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sure, dropped the code

@temporal-nick temporal-nick merged commit a0518c1 into main Dec 2, 2025
5 checks passed
@temporal-nick temporal-nick deleted the nick/releaseprep branch December 2, 2025 18:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants